Feature subset selection for data and feature streams: a review

نویسندگان

چکیده

Abstract Real-world problems are commonly characterized by a high feature dimensionality, which hinders the modelling and descriptive analysis of data. However, some these data may be irrelevant or redundant for learning process. Different approaches can used to reduce this information, improving not only speed building models but also their performance interpretability. In review, we focus on subset selection (FSS) techniques, select original set without making any transformation attributes. Traditional batch FSS algorithms adequate efficiently handle large volumes data, either because memory arise received in sequential manner. Thus, article aims survey state art incremental algorithms, perform more under circumstances. strategies described, such as incrementally updating weights, applying information theory using rough set-based FSS, well multiple supervised unsupervised tasks where application is interesting.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A Parallel Genetic Algorithm Based Method for Feature Subset Selection in Intrusion Detection Systems

Intrusion detection systems are designed to provide security in computer networks, so that if the attacker crosses other security devices, they can detect and prevent the attack process. One of the most essential challenges in designing these systems is the so called curse of dimensionality. Therefore, in order to obtain satisfactory performance in these systems we have to take advantage of app...

متن کامل

A New Hybrid Feature Subset Selection Algorithm for the Analysis of Ovarian Cancer Data Using Laser Mass Spectrum

Introduction: Amajor problem in the treatment of cancer is the lack of an appropriate method for the early diagnosis of the disease. The chemical reaction within an organ may be reflected in the form of proteomic patterns in the serum, sputum, or urine. Laser mass spectrometry is a valuable tool for extracting the proteomic patterns from biological samples. A major challenge in extracting such ...

متن کامل

a hybrid feature subset selection algorithm for analysis of high correlation proteomic data

pathological changes within an organ can be reflected as proteomic patterns in biological fluids such as plasma, serum, and urine. the surface-enhanced laser desorption and ionization time-of-flight mass spectrometry (seldi-tof ms) has been used to generate proteomic profiles from biological fluids. mass spectrometry yields redundant noisy data that the most data points are irrelevant features ...

متن کامل

A Parallel Genetic Algorithm Based Method for Feature Subset Selection in Intrusion Detection Systems

Intrusion detection systems are designed to provide security in computer networks, so that if the attacker crosses other security devices, they can detect and prevent the attack process. One of the most essential challenges in designing these systems is the so called curse of dimensionality. Therefore, in order to obtain satisfactory performance in these systems we have to take advantage of app...

متن کامل

Remainder Subset Awareness for Feature Subset Selection

Feature subset selection has become more and more a common topic of research. This popularity is partly due to the growth in the number of features and application domains. It is of the greatest importance to take the most of every evaluation of the inducer, which is normally the more costly part. In this paper, a technique is proposed that takes into account the inducer evaluation both in the ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Artificial Intelligence Review

سال: 2023

ISSN: ['0269-2821', '1573-7462']

DOI: https://doi.org/10.1007/s10462-023-10546-9